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We describe a new approach to understanding and calculating magnetization switching rates and 
^ t noise in the recently observed phenomenon of "spin-torque switching". In this phenomenon, which 

. has possible applications to information storage, a large current passing from a pinned ferromagnetic 

■ (FM) layer to a free FM layer switches the free layer. Our main result is that the spin-torque effect 

increases the Arrhenius factor exp(— E/kT) in the switching rate, not by lowering the barrier E, 
but by raising the effective spin temperature T. To calculate this effect quantitatively, we extend 
Kramers' 1940 treatment of reaction rates, deriving and solving a Fokker-Planck equation for the 
energy distribution including a current-induced spin torque of the Slonczewski type. This method 
can be used to calculate slow switching rates without long-time simulations; in this Letter we calcu- 
late rates for telegraph noise that are in good qualitative agreement with recent experiments. The 
method also allows the calculation of current-induced magnetic noise in CPP (current perpendicular 

O , to plane) spin valve read heads. 

C0 " 



£ 

i 

-a 



I. INTRODUCTION 

Recently it has been demonstrated that the magnetization of a thin ferromagnetic film can be switched by passing a 
current between it and a pinned layeri This " spin-torque switching" phenomenon is of interest for possible information 
storage applications. Except at very high currents, the switching appears to be thermal in nature. Previous theoretical 
treatments of thermal spin-torque switching 2,3,4 have been based on the idea that the spin torque increases the 
rate by lowering the effective potential energy barrier, and have encountered a fundamental problem: the common 
Slonczewski 5,6 model for the spin torque is not conservative, so it cannot be described by a potential energy. The effects 
of the Slonczewski torque on the Landau-Lifshitz (LL) equation for the magnetization dynamics are similar to those 
of the LL damping, so in our Fokker-Planck approach it makes a contribution to the effective damping. When this 
' contribution is negative, the effective temperature is raised. The notion of an elevated effective temperature during 
^ . spin-torque switching has been discussed previousljiZi&ii; the present Fokker-Planck formulation allows the precise 
I/"") ' definition and calculation of the effective temperature, which we will refer to as the Maxwell-Boltzmann temperature 
\ (Eq. II 6|l and clarifies the relation between it and the (lower) LL noise temperature. 

The Fokker-Planck equation gives the time evolution of a phase space probability density. It was first applied 
to chemical rate problems in 1940 by Kramers^, who observed that except for very large or very small damping 
constants, the escape rate is well described by an earlier "transition state theory" (TST)li, in which the rate of 
barrier-crossing in a non-equilibrium system is assumed to be the same as that in an equilibrium system. Although 
corrections to TST have been extensively studiedi 2 ^, TST has been found to be the most useful starting point for 
rate calculations. In this Letter we will use a TST- like approximation, differing from the usual TST in that the system 
is not in a true thermal equilibrium, but a non-equilibrium steady state. We will write the magnetic Fokker-Planck 
I ■ equation of Brown 1 ^, generalized to include the Slonczewski torque, but following Kramers^ convert it to describe 
diffusion in energy rather than magnetization; to the best of our knowledge this has not been done previously except 
^ ■ for systems with azimuthal symmetryiii^. 

The LL equation— for the evolution of a uniform magnetization M(<) has a deterministic and a random part: 
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M=^ = Mdct + Mrand (1) 



The deterministic part is divided into a conservative precession term and the dissipative LL damping, and we will 
include also the Slonczewski current-induced torque: 

Mdct = M cons + M LL + M slon (2) 

We will first specify the precession torque: 

M cons = - 7 M x H 

cons (3) 

where 7 is the gyromagnetic ratio. We refer to the field H cons about which M precesses as "conservative" because 
it can be written as the gradient with respect to M of an energy density, 

/i H cons = -V-E(M). (4) 
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(This is a 2D gradient on the M-sphere; see Eq. lAll of refiit) 

Our derivation of the FP equation is valid for a system with arbitrary anisotropy, but for specificity we will consider 
the case of a thin-film element (Fig. Q for which the energy density is given (in SI units) byl& 




FIG. 1: Geometry of thin-film element, for the case where the magnetization rh p of the "pinned" layer is along the easy axis e 
of the free layer. 
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Here H cxt is an external field, Hk is the uniaxial anisotropy field, rh = M/M s , e, and z are unit vectors along the 
magnetization, easy axis, and z axis (perpendicular to the film) respectively, and M s is the saturation magnetization. 
The nonconservative LL damping torque (Eq. |2J) is^£ 



M LL = -7aM s rh x (rh x H cons ) = jaM s H cona 
where a is the dimensionless LL damping constant. The Slonczewski spin-torque^ is 

Msion = -j,JM s m x (rh x rh p ) 



(6) 
(7) 



where J is an empirical constant with units of magnetic field, proportional to the current density, and rh p is the 
magnetization direction in the thicker (often pinned) layer from (or to) which the current flows. 

The effect of the random torque M r and is to produce a diffusive random walk on the surface of the M-sphere. We 
will relate this to a diffusivity D (Eq. I18|) by giving the mean square value of the increment AM ran d = M ran( jAt: 



(AM 2 and ) - LDAi. 



(8) 



The directions of these torques are shown in insets to Fig. from which the basic mechanism of spin-torque switching 
can be seen: the Slonczewski torque pulls the magnetization out of well 1 and allows it to jump to well 2. 




_Hcons ' 


M cons 




M LL Msion 



FIG. 2: Energy contours (Stoner-Wohlfarth orbits) for a thin film, plotted in terms of the coordinates 9 and cj> defined in Fig. 
for the case Hk/M 3 — 0.028, H cxt = 0. The regions i = 1,2,3 are blue, red, and black respectively. The vertical scale is 
exaggerated for clarity. Lower inset: contributions to the rate of change of magnetization for a magnetization in the film plane. 
The insets show the tangent plane: magnetization points out of the paper. Upper inset: the same for an arbitrarily chosen 
direction of M. 
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The Fokker- Planck equation describes the evolution of a probability density p(M, t) on the M-sphere. It can be 
written in the form of a continuity equation^* for p(M, t): 

^> = -V-j(M, f ) (9) 

where the probability current j along the sphere has a convective and a diffusive part: 

j(M, t) = P (M, i)M dct (M) - DVp(M, t) (10) 

(note that both the divergence and the gradient are two-dimensional here). Inserting Eq. ^|into Eq. Ogives the FP 
equation (Eq. IA2l of ref^i) first derived (without the spin torque term) in 1963 by Browni4. 

Frequently, the probability density depends mostly on energy, being constant along an orbit and depending weakly 
on phase around the orbit. This is exactly true in a thermal equilibrium system (even with damping), and we 
show below that it is true in a steady state system with a Slonczewski torque, modeling the telegraph noise system. 
It has often been assumed to be approximately true away from the barrier, to compute non-equilibrium switching 
rateal&tl4il£. The energy dependence may be different in different regions of the sphere (for example, different energy 
wells), so we will define a density p' i {E 1 1), where the region (well) index i = 1 for the tj> = well (for E < E sa d), i = 2 
for the <f> = tt well (Fig. |2J, and i = 3 for E > E sac i- [The three will be equal at the saddle point, where all three 
regions touch.] This density p' is related to pi by 



p(M,t) = pi(E(M),t) (11) 

Kramers derived a Fokker-Planck equation in energy for a particle in a well, but we are not aware of any previous 
derivation for the magnetic case so we will derive it here. Though Kramers used it only in the low-damping limit, it 
is an exact description of the steady state of the system even for high damping. 

The FP equation in energy takes the form of a continuity equation 

7 M s p a (g) dpf^t) d E 

— = -dE h{E ' l) (12) 

where the current jf(E,t) is the number of systems per unit time crossing a constant-energy contour. There is a 
factor on the left hand side involving the orbital period Pi{E) because p' is not the probability per unit energy but 
per unit area on the M-sphere (see Eq. IA23l of ref. 17 ). The current in energy can be obtained from the current on 
the M-sphere (Eq. E3 see Eqs. IA4IA13l of ref. 17 for details): 

jf(E, t) = -jaMsp'iE, t)k(E) + jJM s p^E, t)rh p ■ if - £> MM j,(£) (13) 

in terms of a damping term involving an energy integral over an orbit in the i th well 

lf(E)= $ H coas dM, (14) 



a Slonczewski torque term involving a magnetization integral 

lf(E,t) = <j> dM x M (15) 

and a diffusion term. 

For the telegraph-noise problem we require the steady-state form obtained by setting jf = 0: 

d\np'(E) 7 

g E [ ' = ^ [-<*M. + r,{E)J] ee -Vf3(E) (16) 

where the right hand side defines an effective inverse " Maxwell-Boltzmann" temperature [3(E), and V is the volume 
of the switching element. We have also defined a dimensionless spin-torque-damping ratio r)(E) (Fig. |3J) as the ratio 
of the work of the Slonczewski torque (Eq. [JJ to that of the LL damping (Eq. |6j) 

m = -^*±i. (it) 
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Eq. shows clearly that the Slonczewski torque acts like a correction to the LL damping a. Because r\ has opposite 
signs in the two wells, the damping contribution is negative in one well and positive in the other. A similar result 
has been suggested previously^ for the special case in which rh p is parallel to H ext . 
If J = 0, we get the expected Boltzmann distribution with (3 — 1/ksT only if 



D = jM s ak B T/V; 

this is the fluctuation-dissipation theorem. 

If we integrate Eq. 1161 downward from the saddle point into well i — 1 or 2, we get 



p\(E) =p'(E sad )exp 



V 



[l-rH(E)J/aM s }[E sad -E] 



where the average rj is 



(18) 



(19) 



[E sa d — E] 



rn{E')dE' 



(20) 



In region i = 3 we must integrate upwards from i? sa d (see Eq. IA20I of ref 4£) . The ratio ij and its average rj are 
weakly dependent on E (Fig. 131 , so the distribution is nearly a Boltzmann distribution with an effective temperature 
T/[l-rj(E)J/aM s }. 
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FIG. 3: The current-damping coefficients rji and 772 in the two potential wells, and 773 for the region above the saddle-point 
energy. The values we actually use (near the bottoms of the wells) depend only weakly on the parameters i/ ex t — —120 Oe and 
Hk = 220 Oe, which were estimated from the fit to experimental data (Eq. IA28l of refi-). The averages rj i (Eq. I20|l are also 
shown, as dashed lines. 



We now compute the switching rates using transition state theory (TST). The TST rate is the steady-state proba- 
bility per unit time of crossing a vertical line (4> = tt/2) through the saddle point in Fig. [21 This givesii 

"/M s kBT f . . 

JTST = TTT =T-= . , , R (-Esad) (21) 

PvV[l - r) 3 (E sa , d )J/aM s \ 

From the p^s (Eq. I19|) it is straightforward to obtain the total probability pi of being in each well (Eq. IA25I of 
refii£) . With the absolute-rate-theory current j'tst (Eq. 1211) these determine the dwell times t\ and We will write 
these in terms of a stability factor Si = VE^/hsT (E\ is the barrier height E^ad — Ei, where Ei is the bottom of well 
i = 1 or 2) and a critical current at which the exponent in Eq. 1191 vanishes. J C j = aM s /rji(Ei). Since we do not 
know the exact proportionality factor between the parameter J and the actual physical current / we can write J/ J c , 
as 1 1 I c i, where the critical currents Id should be related by 

icim(Ei) = ic2m{E 2 ) = icsmiEs^) (22) 

Then the dwell times are given byi£ 

" PiC^ji-^fe^-W-ll (23) 



JTST l-I/I c i 
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We define an " Arrhenius-Neel" approximation by neglecting / in the prefactor and the —1: 

r t A ~ N = P l {E l )e s ^~ 1 / 1 ^ (24) 

so that the dwell time is just a straight line on a logarithmic plot of t (Figure 0J). We adjust the two parameters 
S\ and I c i to match the slope and value of the measured^ dwell time at the current / = 4.4 ma at which n and Ti 
cross. In the Arrhenius-Neel approximation, these constants have simple graphical interpretations: I c \ is the current 
at which n intersects the horizontal line at the prefactor Pi (the orbit period), and S\ is the (logarithmic) height of 
the dwell time above this prefactor at zero current. 




FIG. 4: Dwell times as functions of current, illustrating the fitting of the room-temperature data of Urazhdin et ah». Inset 
(upper right) shows the experimental points (+ for n, o for T2) more clearly. The exact results are solid lines, the Arrhenius- 
Neel approximation is dotted. Dotted horizontal lines at the bottom are the orbit periods (Pi on the right, P2 on the left). 
Solid curves that approach them at / = are the prefactor Pi (1 — I/I C 3)/(1 — I /Id), which intersects the exact Ti at Id. We 
use h'i = Icirfi/V3- 

It can be seen from Figure 0] that the experimental data determine I c i quite accurately, within a few percent, 
because we don't have to extrapolate very far from the experimental region to reach the prefactor curve. On the 
other hand, this procedure clearly will not work for determining I C 2, because we are extrapolating from positive to 
negative current, and a tiny change in assumed slope causes a huge change in I C 2- Thus we determine I C 2 from Eq. 
1221 instead, and then adjust £2 to give the right value of T2 at the crossing point. The inset to Fig. 0]shows that this 
gives good semiquantitative agreement with the experimental data. Although we forced the slope of t\ to agree, the 
fact that the slope of T2 is much smaller is a true prediction of the theory. 

In addition to the room temperature data we fit in Fig. Urazhdin et all also measured dwell times at T = 4K. 
We show this data in Fig. 0|but it cannot be fit well by the theory. The reason for this can be seen graphically - 
because the slopes of t± and T2 are similar and fairly large, both will intersect the prefactor line at positive current, 
which is inconsistent with the model. It has been suggested^ that an effective LL noise temperature which is different 
in the two wells (this could be due to Joule heating or spin-wave excitation) could explain this, but the present 
graphical construction suggests that this is not possible - some other mechanism must be involved. 

The theory developed here is also applicable to the calculation of magnetic noise in read headsi^; simulations^ 
of such systems show large, apparently chaotic fluctuations under some circumstances, which are predicted by the 
present theory as / — > I c \ . 
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APPENDIX A: EPAPS SUPPLEMENTARY MATERIAL 
1. Basics of the Fokker-Planck (FP) equation 

The Kramers approach to chemical rate theory was adapted to the magnetic switching problem by Brown' 1 - 4 , who 
wrote a FP equation for a probability density p(M, t) on the sphere of possible values for the magnetization M (its 
magnitude is assumed constant at its saturation value M a ). In magnetic systems, the role of "friction" is played by 
the Landau-Lifshitz damping coefficient a. Physically occurring values of a are low enough (about 0.01 to 0.1) that 
the system nearly follows a constant-energy contour (one of the closed orbits of the undamped system) and there is 
a slow diffusion in energy. Brown and others^ who have used the FP equation have dealt mostly with the case of 
non-equilibrium switching, in which one must specify an initial ensemble with all systems in one of the two potential 
wells, and details of the construction of this initial ensemble can strongly affect the resulting rate. If the damping 
is weak, the rate may be slower than TST due to delay in reaching equilibrium near the barrier (this can lead to a 
rate proportional to the damping coefficient in this limitifli^) It is worth noting that the telegraph noise problem is 
in some ways simpler, since we can deal with a steady-state distribution, and the damping-independent TST rate is 
always a good approximation for physical values of a. 

If a is large, we should use the Gilbert formulation 16 and replace 7 by 7/(1 + a 2 ), in Eqs. Eland El but here we 
take a to be small. 

2. Defining temperatures in a magnetic system 

In a magnetic system one must make distinctions among several different temperatures. Clearly if one puts a high 
current through a nanoscale magnetic element, there is the possibility of Joule heating, making the lattice temperature 
of the element higher than that of the substrate (considered as a heat sink). In the Fokker-Planck equation, another 
temperature is the Landau-Lifshitz noise temperature, related to the diffusion constant D in the FP equation by the 
fluctuation-dissipation theorem (Eq. I18|) . Our steady-state solution of the FP equation allows us to relate this LL 
noise temperature T to the Maxwell-Boltzmann temperature, defined by 1/UbTmb = din p'(E)/dE, where p'(E) is 
the probability distribution in energy, by Eq. ^3 Tmb = T/ [1 — r)(E)J/aM s ] . 

3. Effective field (Eq. HJ) 

The effective field is usually defined by 

H^ ons = H cxt + U K e(th ■ e) - M s z(m • z) (Al) 

but can in fact be defined differently (with the anisotropy field perpendicular to the easy axis instead of along it, for 
example) as long as the component perpendicular to M, which we have denoted by H cons (Fig. |SJ) and which can be 
formally written as H cons = — rh x (m x H^. ons ), is unchanged. The gradient H cons (Eq. defined in the text is just 
the component perpendicular to M of the usual formula (Eq. IA1|) for the field, and the component along M has no 
effect on the dynamics. 




FIG. 5: Magnetization vector for the simplest case of precession about a horizontal external field, showing the M-sphere, the 
tangent plane (perpendicular to the paper) and the projected field H CO ns- The dotted circle is the Stoner-Wohlfarth orbit, a 
curve of constant energy. 

The directions of the various torques in the LL equation are shown in insets to Fig. [3of the text, which also shows 
the contours of constant energy on a planar projection of the M-sphere. For in-plane M (6 = 7r/2), the directions are 
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particularly simple, as shown in the lower inset of Fig. [5] The conservative (precession) term M cons is vertical (along 
the energy contour, i.e., the Stoner-Wohlfarth orbit), the LL damping term Mll is horizontal (along the negative 
energy gradient, i.e., H cons ), and the Slonczewski torque term is horizontal and opposite to the damping term (for 
our choice of sign for the current). 

For a general direction of M, (upper inset) the precession and LL damping torques are still exactly along and 
perpendicular to the orbit, respectively. The Slonczewski torque Msion, on the other hand, can be in any direction, 
depending on the thick-layer magnetization direction. 



4. Derivation of Fokker-Planck equation 



Inserting Eq. ^|for the current j(M,<) into the continuity equation @ gives the FP equation 



-£=DV 2 p-V-(pM de t). (A2) 

This is identical to the FP equation derived by Brownie, though the latter looks more complicated because the 
2D Laplacian is written explicitly in spherical coordinates. It is important that the divergence is two-dimensional 
here. If a three-dimensional divergence is used, it has been shown [J. L. Garcia-Palacios and F. J. Lazaro, "Langevin 
dynamics study of the dynamical properties of small magnetic particles", Phys. Rev. B 58, 14937 (1998)] that the ltd 
interpretation of the LL equation gives an extra (radial) term in the probability current J, relative to the Stratonovich 
interpretation. By working entirely in the spherical surface, we ensure that the ltd and Stratonovich results are the 
same. 



5. Derivation of energy current 



By definition of the 2D current j(M,<), the rate at which systems cross the length element dM. (Fig. |HJ along 
the contour (i.e., Stoner-Wohlfarth orbit) is the component of j perpendicular to the element. Thus the total rate of 

\ 

\ 

\ 




FIG. 6: A length element dM along an orbit, for derivation of the energy-current equation (dU). The plane of the figure is 
the tangent plane to the M-sphere. 



crossing from lower to higher E is an integral over the orbit: 

jf (E,t) = j [j(M, i) x dM] • m (A3) 



Using Eqs. 1 101 and l2l for j, we obtain 



jf{E, t) = I p(M, t)M de t(M) • rh - j [DV >(M, t) x dM] • rh (A4) 

The current from the conservative term (Eq. |3J is along e?M and does not contribute to jf . The current (omitting 
the subscript i since we deal with only one well) then has three terms: 

f{E, t) = j&(E, t) + ji on (E, t) + jg s (E, t) (A5) 
The first (Landau-Lifshitz damping) term comes from the Landau-Lifshitz damping torque Mll (Eq. 0: 

3El(E, t) = ia M s j [p(M, £)H cons x dM] ■ rh = - ia M s p'{E, t)lf(E) (A6) 
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where we have used the fact that the energy is constant over the orbit to bring p (Eq. out of the integral, and 
defined a (positive) energy integral if by 

lf(E) ee j> [dM x H cons ] • m = I H cons dM (A7) 

We obtain the last expression because the three vectors in the triple product are mutually orthogonal. 
The remaining term in the deterministic torque is the Slonczewski torque 

Msion = -7 JA/ S m x (rh x m p ) = -7 JM S [m(m • m p ) - m p ] (A8) 

which gives an energy current 

isLC^i *) = -JJM S I [p(M, t) [m(m • m p ) - rh p ] x dM] • m = ^JM s p'{E, t)m p ■ I [dM x m] (A9) 

since the first term in the cross product is orthogonal to m. Defining the magnetization integral 

lf(E) = <j> dM x M (A10) 

gives 

jLn(E, t) = 1 Jp'{E, t)m p • If (All) 
The last (diffusive) term in Eq. I A4I involves 

V P (M,i) = Wp'(E(M),t) = M V£ (M) = -MMh» s (A12) 



and gives the same triple vector product as the Landau-Lifshitz damping term (Eq. IA6|) : 



t B * _ n dpf(E,t) /\ ^ _ . _ n dp'(E,t) tE . 



j$B(E,t) = D ' j> [H cons x dM] • m = -D ' ^ ' lf(E); (A13) 

Our final result for the total energy current (Eq. IA5(I is thus Eq. Inserting this current into the continuity 

equation (Eq. H2jl gives the energy Fokker-Planck equation. 

Since Kramers used the FP equation in energy only in the low-damping limit, which might suggest that it is an 
approximation valid only in that limit, it is important to realize that it is not an approximation at all - as long as 
the probability density p is constant along an orbit, it gives the exact evolution of p(E,t). 

In a steady state, the energy current (Eq. 1 1 31) vanishes, giving Eq. ^j] 

dlll ^ E) = ^ [-oM. + v(E)J] = -V(3(E) (A14) 

This involves the ratio (Eq. I17|) 

plotted in Fig. [3J We have calculated the integrals by solving the LL equation numerically at each energy; an 
analytic check is possible at the bottom of each well (at the left end of the 771 or 772 curve), where |r?,-(JSj)| = 
M S /[\M S + H K t- #ext] ~ 2. 

6. Solving the FP equation in energy 
We can write the inverse effective Maxwell-Boltzmann temperature defined by Eq. I AMI or ^j] as 

0i(E) = t~t^ = rV 1 ~ Vi(E)J/aM s ] (A16) 
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in terms of which Eq. 1191 can be written 



Pi( E ) = p'(E sad )exp 



(A17) 



Note that p'(E sa d) has no region index because it is the same in all three regions. If f3i(E) is independent of energy, 
Eq. IA17I becomes the usual Maxwell-Boltzmann distribution; in any event it is easy to integrate numerically. 

Current-driven thermal switching can be understood in terms of the increase in the Arrhenius rate due to this 
temperature increase. If we increase the current J enough, the temperature at the bottom of the well can be made 
negative; this can be used to model the onset of microwave noise, although our first-order treatment of J will eventually 
fail and we will need to work in terms of nonzero- J orbits rather than the present unperturbed orbits. 



7. Derivation of TST (transition state theory) rate 



The TST rate is the probability per unit time of crossing the dotted line in Fig. which follows the energy gradient 
from the saddle point. This is independent of the damping coefficient, since the damping current pMll (Eq. El has no 




o 1 (f) 2 3 



FIG. 7: The contour plot of Fig. [2] showing the dashed line across which switching from region 2 to region 1 takes place, along 
which we integrate the current to obtain the total TST rate. 



component normal to this line, nor does the diffusive current —DVp, and the conservative and Slonczewski currents 
are independent of damping. To lowest order in the electric current J, we need only include the conservative current, 
and the probability current (left to right) is given by an integral like Eq. IA4I 



Jtst = 



3 (M) x dM] 



m = — 



x dM 



m 



(A18) 



[Since we are in a steady state, the reverse current (right to left, across a similar line pointing upward from the saddle 
point in Fig. EJ is equal in magnitude.] Using Eq. and observing from Fig. 0that all the cross products involve 
perpendicular vectors, we obtain 



jtst = jM s I p' 3 (E)H cons dM = lMl f p ' 3 (E)dE 
J Po J E ^ d 



(A19) 



where we have used dE — /j,oH cons dM (Eq. |2J. The analog of Eq. ^|for the region E > E sa a where we integrate up 
instead of down from £$ad) is 



p' 3 (E)=p'(E sad )exp 



V 



[1 -%(£) J/aM s ][E-E sad ] 



so that 



lM s , , 

JTST = P {E sa dj 

PO 



exp 



V 



[l-f}a(E)J/aM s ][E-E aad ] 



dE 



(A20) 



(A21) 



Noting from Fig. |3that 773 is slowly varying within fcsT of -E S ad, we approximate it by a constant so the integral is 
analytic: 



(A22) 
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It may seem inconsistent to include a term in J /a when we are working to lowest order in J; however, since a is also 
small, we really work to lowest order in both, and J/a is of zeroeth order in this sense. 

In the chemical reaction rate literature^ it is found that TST gives a good representation of the switching rates 
as long as a trajectory that crosses the saddle point from the initial to the final well is likely to be trapped there by 
losing energy to LL damping. The LL damping constants in magnetic materials are large enough (0.01-0.02) that 
escape after one orbit in the final well is quite unlikely, so the TST is likely to be quite accurate. Our telegraph-noise 
fitting involves rates that vary by many orders of magnitude, so corrections to TST of a few percent are not critically 
important. 



8. Derivation of dwell times 



The total probability of being in well i (= 1 or 2) is obtained by integrating the density over the well. The 
probability of being in a surface element d 2 M on the M-sphere with energy E is p' (E)d 2 M , so the probability of being 
in the ring shown in Fig. [S]is 




FIG. 8: A ring on the M-sphere representing energies between E and E + dE, showing the dimensions dM and dM' of an area 
element. 



p'{E)M cons dtdE/n H cons = jp'(E) I M s H cons dtdE/p H cons = jMsPi ^ P '(E)dE 



/'-o 



where Pi(E) is the period of the orbit. Thus the well probability is (using Eq. IA20I for p' 3 ) 



PA E ) dE = P {E sa ,d) 



/'o 



Po Je. 



-E sa d 



Pi(E)exp 



V 



[\-rh{E)J/aM s }[E saA -E] 



(A23) 



dE (A24) 



where Ei is the energy at the bottom of well i. Again, rji(E) and Pi(E) are nearly independent of E, which is true 
in the lowest few k B T of the well, so this integrates to 



Then the dwell time is 



jMsPjjEi) p'(E sad )k B T 
p V l-rn(E t )J/aM t 

l-M{E sa d)J/aM t 



exp 



V 
~knT 



[l-rji(Ei)J/aM s ]El 



JTST 



= Pi(Ei) 



l-rfi{Ei)J/aM s 



exp 



V 
k^f 



[l-Wi{Ei)J/aM s ]E\ 



- 1 



(A25) 



(A26) 



where E\ = _E sa d — E{ is the barrier height. The important thing to notice is that r\\ and 772 have opposite signs, so 
one dwell time increases exponentially with J and the other decreases, as in Fig. 



9. Determination of energy barriers 

One virtue of the fitting scheme described in the text and illustrated in Fig. 0|is that we can determine the stability 
factor Si = V E\ jksTi of each well independently, where the barriers are given byi^ 

E\ - (It H eXb /H K ) 2 KV/k B Ti (A27) 
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though these are in turn consistent with many choices of anisotropy K = HkM s /2, effective external field H BXt , 
temperature Tj , and volume V. For example, if we assume the wells are at the same temperature and H CKt = —120 
Oe, we obtain 

H K = 220 Oe . (A28) 

If we further assume the experimental estimate^ V = 3.67 x 10 -23 m 3 , we obtain an effective temperature T — 730 
K. (Of course, this could be reduced to room temperature by assuming a smaller effective volume.) 
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